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0^ ■ Abstract 

We study random spatial permutations on 1? where each jump x n- 7r(a;) is penahzed 
by a factor q~'^«^~'^w« . The system is known to exhibit a phase transition for low 
r^ ■ enough T where macroscopic cycles appear. We observe that the lengths of such cycles 

O , are distributed according to Poisson-Dirichlet. This can be explained heuristically us- 

^ ■ ing a stochastic coagulation-fragmentation process for long cycles, which is supported by 

P^ ' numerical data. 
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1 Introduction 



C^ ■ Random permutations arc comm.on in probability theory and combinatorics [1]. They also 

U , occur in statistical mechanics, albeit with an additional spatial structure. With A denoting a 

I ■ finite box in 1? ^ we consider the set 5a of permutations of A, i.e., the set of bijections A — > A. 

^ , The probability of a given permutation tt G iSa depends on the jump lengths in such a way that 

all sites are mapped in their neighborhoods. In this paper, we study the model with probability 



-'- ^-(--E ' 



PaW = — exp(^-r}_^||a--7r(x-)rj . (1) 

^ ■ Here T > is a positive parameter and ||a; — 7r(a;)|| denotes the Euclidean distance between x 

\f~^ , and 7r(.T). The normalization Za is defined by 
^H . , . 

(N; Za= ^ exp(-T^||.T-7r(a;)||2j . (2) 

•^ ! -neSK xeh 

v^ \ This model has its origin in Feynman's approach to the quantum Bose gas [13], where T is 

^-^ ■ proportional to the temperature. Bosons are described by Brownian trajectories with ^ playing 

the role of time; this suggests the weights ([T|) for the permutation tt. The presence of a lattice 
is not really justified, but it does not affect the qualitative behavior, at least in dimension 3 
(we comment on dimension 2 at the end of the article) . 
S^ ■ Let us understand the qualitative behavior of the model when we vary the parameter T . The 

5-H \ most probable permutation is the identity, which has weight 1. Typical permutations should 

be close to the identity when T is large, with small cycles here and there. The weight in ([1]) 
penalizes large jumps and we expect that ||a; — 7r(a;)|| < -^ . As T decreases, sites are allowed 
to be mapped to more locations and the lengths of permutation cycles grow. One expects that 
a phase transition takes place (in dimension 3 or more) that is accompanied by the occurrence 
of infinite cycles. See Fig. [T]for a schematic spatial permutation with small T. 
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Figure 1: A typical spatial permutation with periodic boundary conditions for small T. The cycle 
that contains the origin is depicted in black and it may be long. Isolated sites belong to 1-cycles (i.e., 
they are mapped onto themselves). 



The phase transition was observed numerically in [TSl [T7]; it takes place at T^ ~ 1.71. 
The fraction of sites that belong to macroscopic cycles was seen to converge to a non-random 
function ^oo(T') as |A| — > oo, which is continuous and monotone decreasing in T. There are 
many macroscopic cycles and their sizes fluctuate. It was also observed that the average length 
of the largest cycle scales like 0.624j/oo(T)|A|, which is identical to the expectation of the 
largest cycle in a random permutation with uniform distribution |23| . The latter observation 
was unexplained and puzzling at the time. 

The situation is understood better now, and the explanation turns out to be surprisingly 
general. The joint distribution of the length of the long cycles is given by the Poisson-Dirichlet 
distribution. This distribution has been introduced by Kingman in J18j and has cropped up in 
combinatorics, population genetics, number theory, Bayesian statistics and probability theory, 
see [121 HH m Uni mi for details of applications and extensions. 

Occurrence of the Poisson-Dirichlet distribution in models of statistical mechanics, i.e. in 
models with spatial structure, seems to have been noticed only recently. It has been rigorously 
established in the "annealed" model of spatial permutations where the locations of the sites 
are averaged upon ^. The proof is inspired by |24| and it uses a representation in terms of 
occupation numbers of Fourier modes, and non-spatial permutations within the modes. Such a 
structure is not present here, however. 

We recall the definition of the Poisson-Dirichlet distribution in Section 12.21 and provide nu- 
merical evidence in Section [2. 31 that it is present in our model. In order to explain this, we show 
that the equilibrium state can be viewed as the stationary measure of an effective split-merge 
process. This strategy was recently applied successfully by Schramm to the random interchange 
model on the complete graph 22 (the result was first conjectured by Aldous, see [Bl). The 
absence of spatial structure makes the situation much simpler, but it was nonetheless a tour de 
force to prove that long cycles occur, that they satisfy an effective split-merge process, and that 
their asymptotic distribution is Poisson-Dirichlet (see also [51 for subsequent simplifications and 
improvements). This strategy was also devised in |16j for the cycles and loops that arise in 
the Toth and Aizenman-Nachtergaele representations of quantum Heisenberg models in three 
spatial dimensions [25l[2l. It allowed in particular to identify the parameter of the conjectured 
Poisson-Dirichlet distribution. Further situations that look similar include the random currents 



in the classical or quantum Ising models pQ [TUl [H] . 

The key features are as follows: Long cycles are one-dimensional macroscopic objects and 
they are spread uniformly over the whole space. Introducing a suitable stochastic process 
with local changes, we observe that cycles are merged at a rate proportional to the number 
of "contacts" between them, and this number is proportional to the product of their lengths. 
Cycles are split at a rate proportional to the number of self-contacts, which is proportional to 
the square of the length. This is exactly analogous to a split-merge process on interval partitions 
pn 131 [7] . As a consequence, the distribution of cycle lengths at equilibrium should be given 
by the invariant measure of the split-merge process, which is known to be the Poisson-Dirichlct 
distribution. 

This explanation seems very attractive but it glosses over many technicalities. It assumes 
that a spatially uniform distribution of long cycles leads to a "mean-field" interaction and the 
correlations due to their spatial structure can be ignored. We provide mathematical background 
for these ideas in Section [XTI and this allows us to state precise conjectures in Section lX^ These 
conjectures are confronted with numerical results in Section 13.31 As it turns out, the above 
heuristics is fully confirmed. 

2 Distribution of long cycles 

2.1 Nature of long cycles 

Let us first give precise definitions for "macroscopic" , "mesoscopic" and "finite" cycles in the 
infinite volume limit. Given a; G A and a permutation tt € S\, let Lx{tt) denote the length of 
the cycle that contains x, i.e., the number of sites in the support of this cycle. 

• Macroscopic cycles occupy a non-zero fraction of the volume. The fraction of sites in 
macroscopic cycles is given by 

J^macro(r) = lim lim iuf ^Ea f #{x G A : L, > £|A|}). (3) 

e->0+ |A|-)-oo |A| \ / 

• Mesoscopic cycles arc infinite cycles that are not macroscopic. The fraction of sites in 
mesoscopic cycles is given by 

i^n^csoiT) = lim liminf ^Ea (#{x e A : K < L, < ^}) . (4) 

K^oD |A|-s-oo |A| V K y 

• Finally, the fraction of sites in finite cycles is given by 

i^fi„ito(r) = lim liminf -^eJ#{x e A : L, < K}) 

A^-oo |A|-i.oo |A| \ / (5) 

= 1 - Uoo{T). 

Here, Voo{T) = t'mcso(T') + t'macro(r) is the fraction of sites in infinite cycles. 

One expects that only finite cycles are present when T is large, that a phase with macroscopic 
cycles is present when T is smaller than a positive number T^. This was proved in the annealed 
model in [HI [3] . We check this numerically in the lattice model. Let 

( #{xeA:L,,<\A\-} ^ 
P|A|(a)=EA(^ r^] j- (6) 
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Figure 2: Plots of the expected fraction of sites p|A| (a) in cycles of length smaller than or equal to |A|". 
The horizontal dashed line indicates 1 — Voo(T), the fraction of particles in finite cycles in the infinite 
volume limit. The curves have an intersection point independent of |A| at a ~ 0.6 which is therefore 
used as the cutoff to distinguish long and short cycles. !/oo(0.8) is estimated to be 0.292. Averages were 
taken over 5 x 10* realizations. 

denote the fraction of sites that belong to cycles of length less than or equal to |A|°. Notice 
that p|A|(0) is the fraction of particles mapped onto themselves and that p|A|(l) = 1- Numerical 
results for p|A|(a) are depicted in Fig. [2] for various parameters T. 

In a finite domain, we need to define the cutoff that separates finite and "infinite" cycles. 
We can choose |A|" with any power < a < 1, since in the infinite volume limit, Poaifl) will 
not depend on our choice. The graphs of ptSa) depend on the size of the domain, but we see 
in Fig. [5] (d) that they cross the same point around a « 0.6 for T < T^. This value of a is 
approximately independent of T and we choose it for the cutoff, since it significantly reduces 
finite size effects. For the numerical results, we define the fraction v\^\ of sites that belong to 
infinite cycles by 

i.|A|(T) = l-p|A|(0.6). (7) 

In accordance with results for the annealed model [HI El we expect that i^moso(2^) = and that 

lim I^|A|(r) = l^oo(r) = l'macio(^)- (8) 

|A|-i.oo 



This is supported by the numerics in Fig. [2] 



2.2 GrifRths-Engen-McCloskey and Poisson-Dirichlet distributions 

For a given permutation tt G iSa wc call the cycle at x with length Lx(tt) macroscopic if 
Lx{t^) > lAj"'^, as discussed in the previous section. Let L''^^(7r),L(^)(7r), . . . ,L(*''^(7r) denote 
the cycle lengths in decreasing order, where L^^^ is the smallest macroscopic cycle for the 
permutation tt. If A*-*' — -ttt- is the fraction of the sites in the ith macroscopic cycle, we define 

i.W:=A(i)+... + A('=' (9) 

to be the fraction of sites in macroscopic cycles, and have Ea(i^) = i^|a| (T). The sequence (A^'') 
forms a random partition of the (random) interval [0,1^(71)]. We now introduce the relevant 
measures on such partitions, that will allow us to describe the joint distribution of cycle lengths. 
The Poisson-Dirichlet distribution (PD) is a one-parameter family but we only need the 
distribution with parameter 1, so we ignore the paramter altogether. It is best introduced 
with the help of the Griffiths- Engen-McClosckey distribution (GEM). The latter is also called 
the "stick-breaking" distribution. One can generate a random sequence of positive numbers 
(Ai, A2, . . . ) such that J^i Ai = ;/ as follows: 

• choose Ai uniformly in [0,;/]; 

• choose A2 uniformly in [0, v — Ai]; 

• choose A3 uniformly in [0, v — Xi — A2]; 

• and so on..., always chopping a piece off the remaining portion of the "stick". 

This is equivalent to choosing a sequence of i.i.d. random variables (ai, a2, . . . ) where each a^ 
is taken uniformly in [0, 1], and then to form the sequence 

(ai, (1 - ai)a2, (1 - ai)(l - a2)a3, . . .) x v. 

Our goal is to recognize that a given sequence has the distribution GEM. One can invert the 
above construction, and form an i.i.d. sequence out of a GEM sequence. Namely, if (Ai, A2, . . . ) 
is GEM on the interval [0, v], the following sequence is i.i.d. with respect to the uniform distri- 
bution on [0, 1]: 

(ai,a2,a3,...) = (— , ^, — ^ — ^'■■■)- (10) 

PD is a distribution on ordered partitions, and a PD sequence can be obtained by rearranging 
a GEM sequence in decreasing order. On the other hand, given an ordered PD sequence, a 
GEM sequence can be obtained as a size-biased permutation of that sequence [H] . 

2.3 Numerical observations of cycle lengths 

The GEM distribution is easier to handle than the PD distribution, and it contains more 
information. We thus introduce an order on cycles allowing us to establish an order for the cycle 
lengths. This can be done as follows. First, choose an order for the sites of A. Then order the 
cycles according to the smallest sites in their support; namely, given two cycles 7 = (xi, . . . , x\^\) 
and 7' = {x\^ . . . , xj ,,), we say that 7 < 7' if and only if mini<j<|..y| Xi < mini<j<|..j,/| x^. We 
then denote Li, L2, . ■ ■ the lengths of cycles larger than |A|"'^ in this order, which is not to be 
confused with the notation L^ for lengths of cycles rooted in a; e A. 



Let V = v{n) for a given permutation. Our aim is to show that (— m-j "TaT"') converges to 
GEM as I A| — )• oo. as iUustratcd in Fig. |31 This is equivalent to showing that 

converges to a sequence of i.i.d. uniform random variables in [0, 1], sec Eq. (1101) . 

Li/|A| L2/IAI L3/IAI ... 



V finite cycles 

Figure 3: The lengths of macroscopic cycles, divided by the volume, give a random partition of [0, v\ 
which is expected to follow the GEM distribution. 

The Cumulative Distribution Function for Ui is defined as 

F^^{s) = P{a,<s). (12) 

Numerical plots of F^. for the first three cycles can be found in Fig. H) They clearly point to 
uniform random variables. Covariances can be found in Fig. [SJ showing that they do indeed 
tend to in the infinite volume limit. The discontinuities at s = 1 in Fig. |4] for i~ 1,2,3 are 
due to the fact that in finite volumes it may happen that only 0,1,2 cycles larger than |A|°-^, 
respectively, are present. 

2.4 Markov chain Monte-Carlo 

To sample spatial permutations we use a Markov chain Monte-Carlo process which is crgodic 
and has Pa as its unique stationary distribution. Let _Ba denote a suitable set of bonds, i.e., a 
set of unordered pairs {x, y} of sites x,y £ A. Let Txy = Tyx denote the transposition of x and 
y. We say that two permutations 7r,7r' G 5'a are "in contact", noted tt ^ tt', if there exists a 
bond {x, y} G -Ba such that tt' = tt o r^y Let Q{'k, tt'), tt, tt' e 5a, be the transition matrix of 
a continuous-time Markov chain (714 : t > 0) on 5a. 

Proposition 2.1 Suppose that B\ is large enough so that the graph {A,B\) is connected, and 
that (5(7r,7r') > whenever tt '^ tt'. The Markov chain with transition rates Q is ergodic. 

Proof. This is done in [T7], but we recall the argument here. The space 5a of lattice 
permutations is finite and irreducibility of the chain implies ergodicity. It then suffices to show 
that for all tt, tt' g 5a, tt' 7^ tt there exists n < 00 such that Q"'{tt, tt') > 0. Every permutation 
TT can be represented by a composition of transpositions. We still need to show that each of 
these transpositions can be written as a composition of transpositions along bonds of B\. 

Since the graph (A,_Ba) is connected, there exists a connected path {xo,xi, . . . ,Xm) such 
that xo = X, Xm — y, and {xi-i,Xi} G B\ for all 1 < * < "^^ One can check that the following 
composition gives T^^y: 



'X^y 'Xq^Xi ^ ^ ' Xm-2^Xm-l ' Xm-l,Xr 



O ■ ■ ■ O T^ 



(13) 



This shows that every tt G 5a is connected to the identity permutation under the Markov chain 
dynamics. D 
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Cumulative distribution functions Fq;(s) H12|) with i = 1,2,3 for system sizes |A| = 
28^, 256^ and T = 0.8. As tlie volume tends to infinity, 
random variable. Averages were taken over IC realizations. 



128^, 256^ and T = 0.8. As the volume tends to infinity, they converge to the CDF of 



The composition of tt with a transposition is explicitly given by 

{7r(z) xiz^x.y, 
TT{y) ifz = x, 
tt{x) \i z = y. 

Let Ha{tt) denote the "energy" of tt e 5a, 

H^{7:)^TJ2\\'^-A^)r ■ 
xeA 



(14) 
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Figure 5: Covariances of ai,a2,a3 as given in (jlip shown for different system sizes |A| 
32^ 64^ 128^ 256=^ and T = 0.8. 



Here the Euclidean distance |j • || is measured with periodic boundary conditions on a regular 
box A G Z'^. The distribution ([T|) then assumes the familiar form of the Gibbs state e~^'^^'^'' /Z\. 
For the transition rates we choose 



Q(^,7r') = ^^-I 



i(l, c-(^A(^')-ffAW) ) 



if Tt' ~ TT, 

otherwise. 



(16) 



Note that all rates are in [0, 1] and can therefore be used as acceptance probabilities for the 
standard Metropolis algorithm: pick a bond {x, y} S Ba uniformly at random and swap the 
images of x and y under tt with probability 1 if this lowers the energy, and with probability 



-{HAiTv')-HAM) 



< 1 if the swap increases the energy. 



It is clear that the measure Pa fulfills the detailed balance conditions, since 



-Ha{i 



)QK7r')-e-^^("')Q(^',^) 



(17) 



for all 7r,7r' € Sa. This implies stationarity. 

The particular algorithm we use is the "swap only" method described in [l7]. The initial 
permutation is set to be the identity. The Metropolis steps are then as follows: 

• Choose a bond {x, y} of nearest-neighbors at random (we use periodic boundary condi- 
tions). 



The candidate permutation, tt' = iroTxy, replaces tt with probability min(l, i 



-{Ha(^'}-HaM) 



The Metropolis step is then computationally fast since H\ (tt o t, 
on local terms: 



xy 



— H\{t:) depends only 



HAiTT o T,y) - HAin) = T{ \\x-niy)r + lly-^^)!!' - W^'^i^W ' Wv^iyW 



(18) 



We use a standard approach based on the ergodic theorem for sampling, where we let 
the system equilibrate for a number of Metropolis steps of order 10'^|A|, and ensure that our 
measurements are spread over 5 x 10^|A| steps. We have strong numerical evidence that the 
equilibration time of relevant observables is indeed of order |A|, as is shown in Fig [HI 
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Figure 6: Expected value of the fraction of particles in macroscopic cycles for the permutation after 
t Metropohs steps starting from the identity permutation, for |A| = 16^, 32^, 64^, 128^ and T = 0.8. 
Rescaling time by t4t shows the asymptotic behavior, showing that the equilibration time is of order 



|A|. The dashed line indicates the asymptote for the curves, i/oo(0.i 
realizations. 



Averages were taken over 10* 



2.5 Periodic boundary conditions 

We use periodic boundary conditions where A C Z'^ is a S-dimensional torus with equal side 
lengths L. This has the advantage of having less finite size effects than other choices such as 
closed boundary conditions. In the limit |A| — > oo we expect our results not to depend on that 
choice. 

Precisely, for i/ G Z'^ wc define Vi{y) to be the zth component of y modulo L, in such a way 
that Vi{y) G { — -^^ + 1, . . . , -^^l (we assume here that L is even, but the modifications for odd L 
are straighforward). The Euclidean distance on WA is then replaced by 



12/11 



iX.\<y)\") 



1/2 



(19) 



i=l 



Permutations on the torus can be charaterized by their winding number^ which is in reality 
a winding vector. The winding number of -k in the ith direction, i = 1, . . . , d, is the integer 



Wi{ti) = 7 X! "«('^(^) ~ ^)- 



(20) 



xGA 



In a large box, one should not expect any jumps of order L for positive T > because of 
the Gaussian weights ((T)) . The dynamics restricted to such permutations conserves the winding 
number: 



Proposition 2.2 Suppose that the permutation tt satisfies 

max ||7r(x) — x\\ < k — 2. 
xetv 

Then Wi{-K o Txy) = Wi{'K) for all i and all pairs (x, y) of nearest-neighbors in A. 



Proof. Using ([TI)) . we have 

Wi{TTOTr,y) - Wi(7r) = j;V^{TT{y) -X) + j;Vi{TT{x) ~ v) - J^V^{'K{x) - x) ~ 7;Wj(7r(z/) - y) . (21) 

Because of the modulo operation, we have 

Vi(Tr(ii)-x) = ['K{jj)-x]i + kiL, Vi{-K{x)-x) = ['K{x)-x\i + k^L, 
ViiiT^x) -y) = [tt{x) - y]t + k2L, Wi(7r(z/) - y) = [7r(y) - y]i + k^L. 

Here, [-Ji denotes the ith coordinate of the vector in Z"^, and we always have ki e { — 1, 0, 1}. It 
follows from the assumptions that fci = ki and fe2 = k^, so that (j2ip vanishes. D 

The consequence of this proposition is that we effectively lose ergodicity in very large sys- 
tems, since the dynamics conserves the winding number on simulation time scales. On the 
other hand, when macroscopic cycles are present, we expect nonzero winding numbers to ap- 
pear with positive probability in the equilibrium measure. The dynamics always start with 
the identity permutation which has zero winding number. By the proposition, the path to 
nonzero winding numbers must cross bottlenecks, i.e, permutations with large jumps which 
occur with probability less than i e^^(~^^^" . A big part of the phase space is not explored by 
the dynamics. 

By introducing Monte-Carlo transitions that flip the orientations of cycles (which does 
not change their probability), it is easy to move between permutations with even winding 
numbers. On the other hand, it would be interesting to study the winding numbers of typical 
permutations at equilibrium, but this seems to be a very difficult task numerically since it 
requires dynamics that can move between odd an even winding numbers and still sample from 
the correct distribution. 

Due to the metastability of the winding number, the actual mixing time of the Monte-Carlo 
dynamics is (at least) e'^^ with c > 0. However, since we are only interested in cycle lengths 
and not in their orientation, we do not expect this to be relevant for the observables discussed 
in the present paper. In fact, as is shown in Figure IHl and later in Figure 1131 we observe 
convergence on time scales of order |A| = L^. This is still much faster than processes such as 
card shuffles leading to uniform distributions on permutations (see e.g. j27p. which are typically 
of order |Ap with logarithmic corrections. This is due to the fact that for positive temperature 
T the stationary distribution ([Ij is not uniform, and jumps in a typical permutation are local 
of order 1/vT. Furthermore, the identity permutation wc start with is actually the ground 
state (permutation with highest probability) of that measure. 

3 Effective split-merge process 

In the previous section wc presented numerical evidence that the lengths of macroscopic cycles 
are distributed according to the Poisson-Dirichlet distribution. The goal of the present section is 
to explain it with a split-merge process, defined below in Scction lXTl More precisely, the Markov 
chain Monte-Carlo process of Section 12.41 when restricted to the cycle structure, becomes an 
effective split-merge process with the correct rates. We formulate precise conjectures about 
macroscopic cycles in typical spatial permutations, which we then test numerically. 

3.1 Split-merge process 

Recall that a partition A = (A^^^ A'^^ . . . ) of the interval [0,1^] is a sequence of decreasing 
positive numbers such that ^^ A'*' = jy. Here, i^ is any positive real number. The split-merge 
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Figure 7: Illustration for the split-merge process. The partition undergoes a merge followed by two 
splits and another merge. 

process (A(i) : i > 0), also called coagulation-fragmentation, is a continuous-time stochastic 
process on partitions where the ith and jth components (i 7^ f) merge with rate 



and the ith component is split uniformly into two parts with rate 



(23) 



(24) 



Note that the rates are in [0, 1] and they add up to 1, so they can be used directly for the 
following implementation of the process. If A(i) = (A^^^ (i), A^^^ (i), . . . ) denotes the partition at 
time i, one chooses the new configuration A(i + Exp(l)) after an exponential waiting time with 
rate 1 as follows: 

• Choose a first part of the partition with probability proportional to its size. That is, the 
index i is chosen with probability \''^\t)/v. This is called "size-biased sampling". 

• Choose a second part in the same manner, independently of the first. Let j the corre- 
sponding index. 

• Hi ^ j, merge A^*-'(t) and A^-' -'(<). That is, the partition X{t + Exp(l)) contains all parts 
\'-^\t) with k ^ i,j, and a part of size X^'\t) + X^^'){t). 

• If i = j, split A^*-'(t) uniformly. That is, the partition A(i -|- Exp(l)) contains all parts 
A('^^(i) with fc 7^ i, and two parts uA^'^(i) and (1 — u)X^^\t), where m is a uniform random 
number in [0, 1]. 

• The sequence is rearranged so that (A^'^-'(i + Exp{l))) is decreasing. 

The process is illustrated in Fig. [71 Additional background can be found in [31 [7]. Tsilevich 
showed that the Poisson-Dirichlet distribution is invariant for the split-merge process j26| . It 
was proved in |11| that it is the unique invariant measure (see also [22]). 

A key property of our Monte-Carlo process (tt^ : t > 0) described in Section [2^ is that, at 
each step, either a cycle is split, or two cycles are merged. This is illustrated in Fig. [SI Let us 
state this precisely. 

Proposition 3.1 Let tt E S\ and x,y E A with x ^ y. 
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Figure 8: The transposition r^y splits a cycle if x,y belong to the same cycle (left), or it merges two 
cycles if x,y belong to distinct cycles (right). These are the only two possibilities. 



• If x,y belong to the same cycle in tt, then x, y belong to different cycles m tt o r^y. 

• If x,y belong to different cycles in tt, then x, y belong to the same cycle in tt o r^y 

See [17] for more details. It is clear that all cycles of tt that do not involve x or y are also 
present in tt o Txy, and reciprocally. The length of the coalesced cycle is equal to the sum of the 
lengths of the two original cycles, and similarly for a fragmentation. 

3.2 Effective split-merge process for macroscopic cycles 

We have seen in the previous section that each Monte-Carlo step results in either splitting a 
cycle, or merging two cycles. We have also seen in Section 12.11 that two kinds of cycles are 
present: The finite cycles, whose lengths do not diverge in the thermodynamic limit. And the 
macroscopic cycles, whose lengths are positive fractions of the volume. A Monte-Carlo step 
does one of the following: 

(a) Merge two finite cycles. 

(b) Merge a macroscopic cycle and a finite cycle. 

(c) Merge two macroscopic cycles. 

(d) Split a finite cycle (resulting in two finite cycles). 

(e) Splits a macroscopic cycle, resulting in a finite and in a macroscopic cycles. 

(f) Splits a macroscopic cycle, resulting in two macroscopic cycles. 

One expects each of these options to take place with rates of order 0(1) in the limit |A| — )• oo. 
The ones that are relevant to the effective split- merge process are (c) and (f), since their effect 
is reflected in a change in (A(^^(7r(i)), A(^)(7r(i)), . . .), the ordered lengths of macroscopic cycles 
normalized by |A|. Accordingly, we introduce the rate Rij at which the ith and jth largest 
cycles merge. It depends on the permutation tt, and, with i ^ j, it is given by 

i?„-(7r) = 2 Y. J2 Qi^:^°rxy)- (25) 

Notice that RijijT) scales to a constant as the volume diverges, since the sum over x,y is of 
order |A|, and Q{-) is of order l/|i?(A)|, see Eq. p6p . The rate Ri at which the ith largest cycle 
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splits into two macroscopic cycles involves the cut-off K that distinguishes finite vs macroscopic 
cycles: 



x<z^^{i) k=K 



(26) 



Here, 7r'^(x) is the site at "distance" k of x along the cycle 7'-*'. If we set X = 1 in the right 
side, we get the rate at which 7'') splits, irrespective of the sizes of the resulting cycles. The 
expression above gives the rate at which 7*^*) splits in two cycles, each of which has length 
greater than K. As in previous sections we use the cutoff K = |A|°-^. 

We expect that, for almost all permutations in equilibrium, the rates Rtj and Ri are equal 
to those of the split-merge process modulo a constant time-scale R, resulting from the effective 
rate of macroscopic processes (c) and (f) above. Recall that A^'^(7r) = L(*)(7r)/|A| is a random 
variable. 

Conjecture 1 Let T such that Voo{T) > 0. There exists a number R (that depends on T but 
not on indices) such that for all i,j, and all e > 0, 



lim 

lAKoc 

lim 

lAKoc 



\Ri 



\Ri 



-2A«A(j)i?| >£) =0, 
(A('))2i?| > e) = 0. 



The time scale of the effective split-merge process is determined by R, but the invariant 
measure is not, so the exact value of R is irrelevant. It is important, however, that it is 
identical for both the splits and the merges, and for all macroscopic cycles. 




-^ y 



finite cycles 



Figure 9: Schematic drawing of the situation in a mesoscopic box. It contains finite cycles in light 
color, and two legs that belong to each of the macroscopic cycles 7 and 7'. The probability that 7 and 
7' merge in the next step is proportional to the number of 'contacts' between them, which in turn is 
proportional to L-yL^i. The probability that 7 splits into two macroscopic cycles is proportional to the 
self contacts amongst different legs of 7, which is in turn proportional to L^. 



Let us explain the heuristics towards this remarkably simple behavior. Consider a meso- 
scopic box A' whose size is large enough so that boundary effects are irrelevant, yet small 
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enough so that A is made up of a large number of mesoscopic boxes. The restriction of tt on 
A' gives many finite cycles, and open legs that are parts of macroscopic cycles. See Fig. [HI 
for a schematic picture. Let us choose a pair of nearest-neighbors x, y at random, with the 
condition that x, y belong to distinct legs. The probability that r^y merges 7^'^ and 7*-^^ is 
equal to the probability that x belongs to 7'-'^ and y belongs to 7*^^^, or conversely, which is 
equal to 2\^'^'^ \^^'^ / v{T)'^ ^ up to vanishing finite-size effects. The probability that r^y splits 7'^*' 
is equal to the probability that both legs belong to 7^*^ which is equal to (A''-'/^(^))^- This 
heuristics assumes that macroscopic cycles are spread uniformly in space, so they are present 
in all mesoscopic boxes in proportion to their size, and also that the local configurations do 
not depend on the situation in other boxes. Note that short range correlations in the cycle 
structure affect only the probability that a randomly chosen pair x, y belongs to distinct legs, 
which is absorbed in the constant R that determines the time scale of the effective split merge 
process. 

In addition, we also conjecture that when a cycle is split, it is split uniformly. 

Conjecture 2 LetT such that VaoiT) > and define the CDF for a function of the split length 
aG[0,l], 



E 

xG7(') k=K 






(27) 



The 



lim 



|A|- 



A(|0,^'^)W-a|>£)=O. 



If these conjectures hold true, the effective process (A^^^(7r(i)), A^'^-'(7r(i)) . . ■)j>g with sta- 
tionary initial condition converges in the limit |A| — >■ 00 to a split-merge process as defined in 
((23)) and (|24)) , running with total rate R v^ . Therefore the distribution of cycle lengths has to 
be invariant with respect to the split-merge process, so it has to be Poisson-Dirichlet. We now 
check these conjectures numerically. 
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Figure 10: A single permutation has been chosen with respect to the equilibrium measure for [A] = 128, 
T = 0.8. The longest cycle 7'^' is spread everywhere. (Left) Scatter plot of every 500th site along 7''^'. 
The color indicates the distance along 7'^ , renornialized by the length of 7'^ , starting from the site 
closest to the origin. (Right) The box A of volume 128"^ has been partitioned in 64 subsets of volume 
32^. The histogram depicts the number of sites of 7'^' that can be found in each subset. It is essentially 
constant. 
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3.3 Numerical data about the effective split-merge process 

In this section wc give numerical evidence that the rates for sphtting and merging macro- 
scopic cycles converge to those of a split-merge process, thus confirming the Poisson-Dirichlet 
distribution of macroscopic cycles. The heuristics behind this argument, as described in the 
previous section, is that macroscopic cycles are distributed uniformly amongst lattice sites on 
a mesoscopic level, and not are confined to a bounded region. This is illustrated in Fig. [TU] 
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Figure 11: Expectations (left) and standard deviations (right) of the rates defined in (|28p . |A| = 
8^, 16'', 32"^, 64^, T = 0.8. The standard error for the mean is within the marker size. Averages were 
taken over 10^ realizations. 



In order to verify Conjecture [TJ the rates of merging the two longest cycles, splitting the 
longest, and splitting the second longest in the next timestep were calculated using (PS|) and 
dlSD. We define 



,w 



Rij (71-) 



riiTT) 



(A(''))2 



(28) 



Fig. [TT] shows that Vij and r^ converge to a constant R as expected, supporting Conjecture 
1. Note that in addition to convergence of the mean the variance is decreasing, confirming 
convergence in probability as stated in the conjecture. 
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Figure 12: Cycles split uniformly. Plot of Sj (tt) as defined in H27p for a typical permutation tv under 
the equilibrium measure with |A| — 64^, T — 0.8. 

If the cycle is split uniformly, 6*^ -^ (tt) as defined in Conjecture 2 is the CDF of a uniform 



15 



random variable on [0,1]. The graph of Bi" (tt) is shown in Fig. [T^for a given permutation tt 
chosen randomly from the equilibrium measure, which confirms the expected behaviour. 
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Figure 13: Expected value of the number of cycles longer than |A|°'^, EA,t(A''|A|), as a function of time. 
The Monte-Carlo chain starts from the identity permutation and has been recorded for various system 
sizes. Averages were taken over 10'* realizations. 



4 Further prospects 

The emergence of macroscopic cycles seems intriguing and it is worth being studied. Start- 
ing the Monte-Carlo Markov chain from the identity permutation, one expects the system to 
display only finite cycles for some time, before infinite objects built up. Let Ea,* denote the 
corresponding expectation, and A''|a| (tt) denote the number of cycles of length larger than | A|°'^. 
Fig. [T3l shows EA.t(Af|A|) for T = 0.8 and various volumes. There is a peak at the transition 
to the phase with macroscopic cycles. It is certainly due to the presence of many mesoscopic 
cycles for a short time, that are going to merge afterwards. It would be interesting to get plots 
for numbers of cycles larger than |A|° for a other than 0.6. 

While the fraction of sites in large cycles, EA,t(i'), varies continuously with time on the 
scale i/|A|, the steps (c) and (f) of Section [321 take place at a very high rate (proportional to 
|A|) once the phase with macroscopic cycles has been reached. One then expects the lengths 
of macroscopic cycles to split and merge so fast that the Poisson-Dirichlet distribution appears 
immediately. The numerical results of Fig. [T3] confirms this. Indeed, the expected lengths of 
the longest cycles, when divided by the number of sites in macroscopic cycles, is equal to the 
expected values obtained with respect to the Poisson-Dirichlet distribution, that can be found 
e.g. in US]. 

Finally, let us comment on the physical dimension, taken here to be d = 3. It is safe to bet 
that everything is similar in all dimensions greater than 3. On the other hand, the dimension 
d = 2 remains mysterious. There are certainly no macroscopic cycles, as was observed in 
|15j . An open question is whether a phase occurs where a positive fraction of points belong to 
mesoscopic cycles. This has been ruled out in the "annealed" model that involves averaging over 
point positions [SUH]. The present lattice model may be closer to a Bose gas with interactions, 
on the other hand, where a Kosterlitz-Thouless phase transition is expected. The presence of 
mesoscopic cycles could indeed be related to the slow decay of correlation functions. 
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Figure 14: The Poisson-Dirichlet distribution occurs already during equilibration, (a) Expected values 
of the fraction of sites in macroscopic cycles ¥.A,t{i^), and of the lengths of the four largest cycles (divided 
by the volume), (b) The expectation of the i-th longest cycle has been divided by the average length 
of the i-th part in a random partition with Poisson-Dirichlet distribution. It is always close to EA,t{i^)- 
|A| = 128'^, T = 0.8. Averages were taken over 10* realizations. 
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